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Abstract 

In recent years directional multiscale transformations like the curvelet- 
or shearlet transformation have gained considerable attention. The reason 
for this is that these transforms are - unlike more traditional transforms 
like wavelets - able to efficiently handle data with features along edges. 
The main result in [27] confirming this property for shearlets is due to 
Kutyniok and Labate where it is shown that for very special functions 
ij) with frequency support in a compact conical wegde the decay rate of 
the shearlet coefficients of a tempered distribution / with respect to the 
shearlet tj) can resolve the Wavefront Set of /. We demonstrate that 
the same result can be verified under much weaker assumptions on ip, 
namely to possess sufficiently many anisotropic vanishing moments. We 
also show how to build frames for L 2 (R 2 ) from any such function. To 
prove our statements we develop a new approach based on an adaption of 
the Radon transform to the shearlet structure. 
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1 Introduction 

1.1 Previous work and notation 

One of the main themes of computational harmonic analysis is to represent 
a given function by its inner products with respect to a given set of 'atoms'. 
Prominent examples for such representations are the Gabor transform [22 or 
Wavelet Transforms |TS]. A common feature of most of these 'classical' repre- 
sentations is that they are isotropic, meaning that they treat every direction in 
space equally. For this reason they are not able to efficiently and accurately han- 
dle directional phenomena at different scales. This is certainly a large drawback 
considering the fact that in many applications, like for example image process- 
ing, the main information lies precisely in the directional features (edges). 

Until recently, one mostly had to resort to adaptive methods in order to 
capture such phenomena, but in 2003 Candes and Donoho managed a break- 
through in this problem by introducing the curvelet transform [5, 9J 1 10] which 
can be used to analyze functions defined on the plane IR 2 . 

The idea behind curvelets is to carve up the frequency (i.e. wavenumber- 
) domain into circular wedges each corresponding to a scale a and a direction 
9 € (0, 2ir] obeying the parabolic scaling relation 



Then, similarly to the Littlewood-Paley construction [ST], Candes and Donoho 
construct functions ^afifiix) := 7a,o,o(Re^), % € R 2 , Rg denoting the rotation 
matrix by 9, which have frequency support in the wedge corresponding to scale 
a and direction 9. To also localize in space they use the translates of these 
functions and arrive at the family of 'atoms' 



which correspond to scale a, location t and direction 9. In [9] they prove 
a representation formula which states that any i 2 -function / can be fully and 
stably recovered from its curvelet coefficients (f,"/ a ,e,t)- Moreover, they show 
that the decay rates of the curvelet coefficients give precise information on the 
directional behavior of / [S] . 

The idea of partitioning the plane into 'parabolic wegdes' is not new, by 
the way and already occurs in [171 131 j . although in a very different context. It 
is also featured in [33] under the name 'Second Dyadic Decomposition'. In [2 
various discrete curvelet-like frame constructions are given based on the same 
partition and the notion of decomposition spaces |19[ 118] . 

In the construction of the curvelet system one starts with the construction 
of the functions 7 a ,o,o which correspond to scale a. In particular, curvelets do 
not form an affine system, they are not generated from one (or finitely many) 
functions. 

Looking at wavelet transforms it might seem natural to define 7 a ,o,o = 
DA a ip, where Da u denotes the dilation operator with the expanding matrix 
A a = diag(a, a 1 / 2 ) (this choice of dilation matrix reflects the parabolic scaling 



width ~ length 2 . 



(1) 



Ja,e,t( x ) = 7a,o,o(R-e(^ - *)) , 



a e R+,6 £ (0,2tt], * e K 2 , 
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relation (Q])), and consider the system 7 a ,e,t = T t Dn e DA a ip, where we define for 
t e R 2 the translation operator T t : / i-» /(• - t) and for M e GL(2,R) the 
dilation operator D^i : f h-» det(A/) _1 / 2 /(M -1 -). This is what has essentially 
been done in [33] and goes by the name 'Hart Smith's transform' in [§]. Note 
that the mapping (a, 9, t) i— > T t D^ e DA a does not carry the useful structure of a 
group representation. This lack of structure makes it more difficult to constuct 
tight frames for this system. 

Remarkably, a group structure can be achieved if one replaces the rotation 
transforms 

/ sin(0) cos(0) \ 
9 \ cos(0) -sin(0) J 

by shear transforms defined by 

Ss := ( 1 ) 

and consider the system 

i) ast :=T t D S3 D Aa i) (2) 

for some shearlet ip [131 114] . The directional component is now encoded in 
the shear parameter s £ K. 

Definition 1.1. The Shearlet Transform of a function (or distribution) f de- 
fined on R 2 with respect to a function (or distribution) ijj defined on R 2 is defined 
as 

SH i) f{a,s,t):={f,ip ast ), a e R+, s e K, t e K 2 (3) 

and 

tPast{Xl,X 2 ) = a 1 ip{ , — 775-)- (4) 

As stated before, the system ^ can be regarded as the orbit of a function 
Tp under the action of a unitary representation of a group, the so-called shearlet 
group § = (K+ x R x M 2 , o) with 

(a, s, t) o (a, s, t) = (ad, s + S\fa, t + S s A s t). 

Using this structure one can apply the machinery of square integrable group 
representations to obtain representation formulas for L 2 -functions. We denote 
by / the Fourier transform of a function / € L 2 (R 2 ) defined by 

/M = / f{x)e- 2 ™™dx. 

Definition 1.2. A function ip is admissible if and only if 

CV= / H^ji-dw= f I \(D Ss D A ^) A (Z)\ 2 a- 3 / 2 dads < 00. (5) 

Jr 2 m I 

Admissible functions are called shearlets. 
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The equivalence of the second and third term in ([S]) can be seen by using 
the substitution u>(a,s) = D s -tD a -t£. The reason why this works (meaning 
that the integral is independent of £) is just the group structure of the shearlet 
group and the fact that the left Haar measure is given by a~ 3 dadsdt [T3j. The 
following notion will often be used in the sequel: 

Definition 1.3. We shall say that f has n-vanishing moments in x\-direction 
if 

I^MI 2 



M 2n 



■du> < oo. (6) 



Remark 1.4. The reason for the teminology 'vanishing moments' for the con- 
dition (0) is as follows: if we assume sufficient spatial decay ofij), the condition 
is (almost) equivalent to 



t\ip(x\, X2)dx\ = for all X2 S R, 



k < n 



see e.g. \29] for similar statements related to wavelets. 

With some Fourier analysis one can show the following: 

Theorem 1.5. Ifip be an admissible shearlet, then for all f £ L 2 (R 2 ) we have 
the representation formula 

||/||1= f \S-H^{f){a,s,t)\ 2 a- Z dadsdt, (7) 

J{a.s,t)esn 

From the admissibility condition we see that shearlets exist in abundance. In 
fact all they need to satisfy is to have one vanishing moment in the xi-direction 
or equivalently to be a partial derivative in xi-direction of a square integrable 
function. We denote by -ff( nii „ 2 )(R 2 ) the Sobolev space defined by 

ff (ni ,„ 2 )(M 2 ) := {/ € L 2 (R 2 ) : t^)" 1 (^)" 2 / € i 2 (M 2 )}. 

Theorem 1.6. Let 6 be in H( n ^(R 2 ) with 6(0) ^ 0. Then the function 

^( x ) = (-ir(^-) n e(x) (8) 

is a continuous shearlet with n vanishing directional moments in x\-direction. 
Conversely, let ip be a continuous shearlet with n vanishing moments. Then ip 
can be written in the form (0) with a function 9 £ H^ n ^(M. 2 ) . 

Proof. This is an easy exercise. □ 

The representation has one drawback: while in the curvelet transform the 
directional parameter 9 ranges over a compact set, we now need to consider all 
shear directions s € K. It is also easy to see that as s gets large the directional 
resolution gets denser and denser, so that the parameter s does not distribute 
the directions uniformly. In order to avoid this dependence on the choice of the 
x\ and ^-coordinates, Labate et. al. proposed the following construction called 
'shearlets on the cone' in [251: 
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Let / £ L 2 (R 2 ). Then decompose / = P v f + P c f + Pc f where P v is a 
frequency projection onto [— 2,2] 2 , P-p ontoC = {£ : > 2, |g| < 1} and Pc 

onto C = {£ : IC2I > 2, ||^-| < 1}. It is well-known from microlocal analysis 
that directional singularities with slope > 1 manifest themselves as slow decay 
in C and singularities with slope < 1 as slow decay in C v . Therefore Pcf can 
be seen as the part of / containing singularities with slope > 1, Pcf as the 
part of / containing singularities with slope < 1 and Ppf as a smooth low-pass 
approximation of /. 

The 'shearlets on the cone'-construction essentialljQ goes as follows, denoting 
by the space of rapidly decaying C°°-functions: 

Let V be defined via = ^i(£i)V> 2 (|-) where i> x £ Cg°(R) is a wavelet 
which has frequency support supp ipi C [—2, —1/2] U [1/2, 2] and HV^Ib = 1 and 
^2 € Cq°(R) with supp *p 2 C [-1, 1], and </> 2 > on (-1, 1). 

Then with W a suitable window function one can show the following [2"T1 
Equation (3.7)]: 

II/II2 = / \(Pvf,T t W)\ 2 dt+ f f f \SH^{Pcf){a lS ,t)\ 2 a- s dad S dt 

JteM 2 Jt£K 2 J-2J0 

+ [ [I \SH^(Pcf)(a,s,t)\ 2 a- 3 dadsdt, for all / G L 2 {M(§} 

JtGR 2 .7-2 JO 

where we let ^"(£1,6) := £1) and SH^f(a,s,t) = (/.Cst)- 

More importantly, Kutyniok and Labate have shown in [37] that the shearlet 
coefficients in the representation ((5J) characterize the Wavefront Set WF(/) of 
a tempered distribution /, which roughly means the set of points t £ I 2 and 
directions s £ R along which / is not smooth at t. See the next section on more 
information regarding the Wavefront Set. The result [57J Theorem 5.1] is as 
follows: 

Theorem 1.7. Let ip be constructed according to the 'shearlets on the cone'- 
construction. Let f be a tempered distribution and T> = Z>i U T>i, where T>\ — 
{(to, so) G !R 2 x [— 1, 1] : for (s, t) in a neighborhood U of (sq, to), \SJi^f{a, s,t)\ = 
0(a k ) for all k £ N, with the implied constant uniform over U} and T>% = 
{{to, so) G K 2 x (1, 00] : for (1/s, t) in a neighbourhood U o/(so, to); \S1-C^f(a, s,t)\ = 
0(a k ) for all k £ N, with the implied constant uniform over U}. Then 

WF{ff = V. 

We would like to mention that curvelets and shearlets are not the only con- 
tinuous transforms capable of 'resolving the Wavefront Set', see for example 
[T2l 1521 I3"3"] . From a computational point of view curvelets and shearlets have 
probably received the greatest attention recently (although many ideas are al- 
ready contained in [33] ). 

A Word on Notation. We shall use the symbol | ■ | indiscriminately for 
the absolute value on K, R 2 and C. We usually denote vectors in M 2 by x, t, £, ui 
and their elements by x±, x%, t\, t%, . . . . In general it should always be clear to 
which space a variable belongs. The symbol |j • || is reserved for various function 
space and operator norms. 

'we have modified the construction a bit in order to remain closer to our later results, for 
the precise construction we refer to I27| 
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1.2 Contributions 



The motivation of the present work is the fact that on the one hand, by Theo- 
rem [Tj)] we know that shearlets exist in abundance, but on the other hand the 
'shearlet on the cone'-construction described above is very specific. In partic- 
ular it requires ip to have compact frequency support and thus infinite spacial 
support which might be undesirable for some applications (the same caveat is 
valid for curvelets). So the question we would like to answer is: 'what is really 
needed for a function t/j so that a representation similar to © and a result like 
Theorem II .71 are valid?'. It turns out that there are no restrictions on ip besides 
the obvious ones, i.e. vanishing moments in the xi-direction: 

In Section [S] we show (among other things) the following theorem: 

Theorem 1.8. Let ip be a Schwartz function with infinitely many vanishing 
moments in x\- direction. Let f be a tempered distribution and D = Dx U Z>2, 
where V x = {(t , s ) el 2 x [-1, 1] : for (s, t) in a neighbourhood U of (so,io)> 
\STL^f(a 1 s,t)\ = 0(a k ) for all k £ N, with the implied constant uniform over U} 
and T>2 = {(^Oj^o) £l 2 x (l,oo] : for (l/s,t) in a neighbourhood U of (so, to), 
\S7i^f(a,s,i)\ = 0(a k ) for all k S N, with the implied constant uniform over 
U}. Then 

WF(f) c = V. 

We also show an analogous result if ip has only finitely many vanishing 
moments in xi-direction. 

In addition, in Section [4] we show that for essentially any shearlet a repro- 
duction formula similar to ([9]) holds. To show this we shall use the notion of 
continuous frames. 

These results provide a substantial generalization to the previous results in 
P7] . In particular they allow for the first time to also use compactly supported 
functions if>, for example tensor-product wavelets, in the analysis. 

We would like to remark that, while the methods of proof of the results 
PTl [§] are quite similar to each other, our proofs follow different arguments. 
As a main tool we use a version of the Radon transform which is specifically 
adapted to the shear operation. 

Outline. The outline is as follows. In the next section, Section^ we intro- 
duce the notion of TV- Wavefront set. We also introduce the Radon transform 
which serves as an extremely convenient tool for our analysis. Then, in Section 
[5] we show a direct theorem stating that S7i^f(a,s,t) has fast decay in a if / 
is smooth in t and direction s. As already noted, in Section [¥] we derive sev- 
eral representation theorems for L 2 -functions based on the notion of continuous 
frames. In Section [5] we show an inverse theorem stating that / is smooth in t 
and direction s if STL^f(a, s,t) has fast decay in a. Section [6] summarizes these 
results and contains some theorems concerning the resolution of the Wavefront 
Set. 

2 The Wavefront set 

In this section we introduce and explain the notion of the Wavefront Set of a 
tempered distribution / which has its roots in the analysis of the propagation 
of singularities of partial differential equations , see also |3S] for a discussion 
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of various notions of Wavcfront Sets. It is in some sense easier to observe 
directional 'microlocal' phenomena in the Fourier domain and therefore the 
definition wc give for the Wavefront Set is formulated in terms of the Fourier 
transform of a localized version of /. We shall however see in the part on 
the Radon transform that the so-called Projection Slice Theorem gives us a 
tool to study microlocal behavior of / in the spacial domain - by studying the 
(univariate) regularity of the Radon transform of a localized version of /. 

2.1 Definition 

We now give the definition of the Wavefront Set of a tempered distribution. 

Definition 2.1. Let N £ R and f tempered distribution on R 2 . We say that 
x £ R 2 is an TV-regular point if there exists a neighbourhood U x of x such that 
Qtp £ C N , where $ is a smooth cutoff function with $ = 1 onU x . Furthermore, 
we call (x, A) an N- regular directed point if there exists a neighbourhood U x of 
x, a smooth cutoff function $ with $ = 1 on U x and a neighbourhood V\ of A 
such that 

(<S>.f) A (v) = + \v\r N ) for all n = ( m , m ) such that ^ £ V\. (10) 

The N- Wavefront Set WF N (f) is the complement of the set of N -regular di- 
rected points. The Wavefront Set WF(f) is defined as {J N>n WF N {f). 

It is not clear at first sight that the Wavefront Set according to the definition 
given above is well-defined, meaning that it is independent of the localization 
function $. Below we prove that it actually is: To keep things simple we shall 
mostly restrict ourselves to / € L 2 (R 2 ). 

Lemma 2.2 (Localizing does not enlarge the Wavefront Set). Let f £ L 2 (R 2 ) 
be a function such that 

f{rf) - 0((1 + \r]\)- N ) for all n = ( m ,n 2 ) such that ^ e V x 

where V\ is some open subset of R with A £ V\ . Let $ be a smooth function. 
Then there exists an open neighbourhood of A e R such that 

($/) A (77) = 0((1 + \ V \)- N ) for all V - ( m , m ) such that ^ e V{. 

In other words: If(x,X) is an N -regular point of f , it is also an N -regular point 
of$f. 

Proof. Let Ao £ V\ and e$ a unit vector in R 2 with slope Ao- We want to show 
that 

(a>.f) A (te )-o(ir Ar ). 

Let us compute 

($.f) A (teo) - $ * /(tco) = I f(te - OHOdC (11) 

JR 2 
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Since Ao € V\ and V\ is open, there exists 1 > 8 > such that teo + tB$ is still 
contained in the cone of all points £ with £, 2 /£,i € V\ for all £ G K and B$ the 
ball with radius S around the origin. Now we split the integral into 

m = / /(teo - e)*(Ode + / /(teo - e)*(e)d£ . 

J\Z\<6t J\£\>6t 

" v ' " v ' 

A B 

By assumption we know that for |£| < St we can estimate 

|/(£ e0 -0l = O(|te -er Ar )-O(|£|- JV ). 

Therefore it is easy to see that \A\ = 0(\t\~ N ). To estimate B we employ 
Cauchy-Schwarz and again the smoothness of 

\b\ < ! i/(£e - mHt,m < 11/H2 / \m\ 2 dt, 

< C||/||2 / (1 + |^|)- JV - 1 ^ = 0(|£|- JV ). 

□ 

Remark 2.3. While the above lemma assumes that f € L 2 (R 2 ), £/ie result is 
actually valid for any tempered distribution f . This can be shown as above by 
first assuming that f is a slowly growing function and using the Holder inequality 
instead of Cauchy-Schwartz. The case of a general tempered distrubution f is 
then handled by repeated partial integration. 

2.2 The Radon transform 

We introduce the Radon transform [16 . As we shall see later it will serve us as 
a valuable tool in the proofs of the later sections. 

Definition 2.4. The Radon transform of a function f is defined by 

TZf(u,s):= I f(u-sx 2 ,x 2 )dx 2 . (12) 

Observe that our definition of the Radon transform differs from the most 
common one which parametrizes the directions in terms of the angle and not 
the slope as we do. It turns out that our definition is particularity well-adapted 
to the mathematical structure of the shearlet transform. The next theorem 
already indicates that the Radon transform provides a useful tool in studying 
microfocal phenomena. 

Theorem 2.5 (Projection Slice Theorem). 

(rc/(u,«)) A M = /(w(M)). (13) 

For the convenience of the reader and because the proof is so short we show 
how this can be proved: 
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Proof. 



(Kf(u,s))*{uj) = f f f(u- sx 2 ,x 2 )e- 2m ^dx 2 du 

JR JR 

f(u, x 2 )e- 2m{ru+SX2)u dx 2 du = a)). 

□ 

By the Projection Slice Theorem, another way of stating that (x, A) is an 
iV-regular directed point is that 

{H^f{u,s)) A {uo) = 0(\u\~ N ) and s e y A , 

or in other words, that H<f>f(u, s) is sufficiently smooth in u around s = X. 

Since wavelets can deal perfectly well with univariate functions it is a natural 
idea to perform a wavelet analysis on the Radon transform of a bivariate function 
in order to study directional properties. This idea has been implemented in the 
ridgelet transform [7] and has led to the construction of curvelets and shearlets 
as we now briefly explain: 

The original ridgelet transform first partitions the function / to be studied 
into parts which are located in small but fixed spacial rectangles and then rep- 
resents the Radon transform of each of these parts via a wavelet transform. It 
can be shown that any / with only directional singularities along a curve with 
small curvature can be effectively represented in this way. However, if there are 
no restrictions posed on the shape of the singular set of /, a multiscale approach 
is required which leads to multiscale ridgelets [5J. It turns out that the whole 
set of multiscale ridgelets is too redundant and not frameable. A solution to 
this problem was given with the 'first generation curvelet transform' which first 
applies a bandpass filter to / and then analyzes the the frequency band ~ 2 J 
with multiscale ridgelets of scale ~ 2-?/ 2 - corresponding to the parabolic scal- 
ing relation. Realizing that the crucial thing is the relation (TT]) a much simpler 
curvelet construction has been given in [5] and this construction is what is now 
usually referred to as 'curvelets'. 

3 A direct theorem 

In this section we show that for an ./V-regular directed point (to, sq) of a func- 
tion / e L 2 (M. 2 ) the shear let coefficients with respect to any function with 
sufficiently many vanishing moments in the xi-direction decay quickly around 
t = to, s = sq. This fact is especially important for applications where sparse 
representations of objects with edges are desired [HUM]- We assume here that 
/ is square integrable since for general tempered distributions we would have 
to require tp to be a Schwartz function. The extension to general tempered 
distributions is however straightforward, see Remark I2.3I 

Theorem 3.1 (Direct Theorem). Assume that f is an L 2 (R 2 ) -function and that 
(to, so) is an N -regular directed point of f. Let tp € i?( .L)(M 2 ), ip G L 1 (IR 2 ) be 
a shearlet with M moments which satisfies a decay estimate of the form 

i,(x)=0((l + \x\)- p ). (14) 
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Then there exists a neighbourhood U(to) of to and V(so) of s such that for any 
1/2 < a < 1, t € U(to) and s £ V^(so) we /iaue t/ie decay estimate 

SH^fia, s, t) = 0(a- 3 / 4+p / 2 + a (1 -^ M + a -^ +aN + a^ 1 ' 2 ^) . (15) 

Proof. First we show that we can without loss of generality assume that / is 
already localized around to, i.e. / = $/ where $ is the cutoff function from the 
definition of the 7V-wavefront set which equals 1 around to. To show this we 
prove that 

((1 - <&)/,</w t > = 0(a- 3 / 4+p / 2 ). (16) 

By definition we have 

1past{xi,X 2 ) = a J/ 1p{ , r-^-j. (17) 

a a L >^ 

Now we note that in computing the inner product (JT5J) we can assume that 
\x — t\ > 8 for some 8 > and t in a small neighbourhood U(t ) of to since 
(1 — $)/ = around f - By (|T3)) we estimate 

< ^d+i (; -„t; /2 ) 

< Ca- 3 / 4 (l + C{s)a- l ' 2 \x - t\)- p = 0(a- 3 / i+p / 2 \x - t\- p ) 

for \x — t\ > 8 and C(s) = (1 + 4 + (s 2 + x) 1/2 ) 1/2 (compare [271 Lemma 5.2]). 
We can now estimate 

((l-$)/,Vw) < Ca- 3 / 4+p / 2 / |x-t|- p |l-$( a;i ,ar 2 )||/( a;i) x 2 )|d : r 1 d : r 2 

«/|a:-t|><5 

= 0(a- 3 / 4+p / 2 ) (18) 

for i e J/(£o) ancl this is (|T5|) . Now, assuming that / = <&/ is localized, we go 
on to estimate the shearlet coefficients |(/, 4>ast)\- First note that the Fourier 
transform of ip a st is given by 

i>ast(0 = a 3 / 4 e- 2 ^^(aCi,a 1/2 (6 - »&)). (19) 
Now pick | < a < 1 and write 

\{f,i>ast)\ = \(f,$art)\<Q 3/i [ l/(ei,6)II^Ki,a 1/2 (6-^i))|rfe 

= o?' A I + a 3/4 / . (20) 

^ / v. / 



\- l \x-t\r p 



B 



Since ip possesses M moments in the x\ direction which means that ip(^i , £2) 
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£i $(£i>£ 2 ) with some 9 £ L 2 (R 2 ), we can estimate A as 

A = a 3 / 4 / |/(6,6)ll^«i,a 1/2 (6-^i))l^ 

J\il\<a-" 

= a 3/4 / a M iai M i/(a,e 2 )ii^(a6,a 1/2 (6-sei))i^ 

•/|£i|<a-° 

< a^-^a 3 / 4 / |/«i,6)||^i,a 1/2 (6-^i))|de 

7ieii<o-« 

< a (1 - Q)M (|/|, 1^1) < a (1 - Q)M ||/l! 2 ||^ t || 2 = a ( 1 - a ) M ||/|| 2 ||0|| 2 .(21) 



In order to estimate B we make the following substitution: 

d4i<%2 = a- z/2 diidt, 2 



a \ &\ - 



Then 



s = «- 3/4 /. , i/(^,-ei + a - 1/2 6)iiv3(a,| 2 )Me (22) 

Jliii >a - Q a a 



Now we shall use that (io, so) is a regular directed point of /. This means that 
there is a neighbourhood (sq — £, so + e) such that 

f(vuV2) <C(l + \ V \)- N forall^G ( So -e, So + e). (23) 

m 

Looking at (|2"2"|) we now consider 2a with 771 := 772 := ^£1 + a -1 / 2 ^ and 
^ > a~ a and get the estimate 

s - a"- 1 / 2 ^ < ^ = a + a-^&t < s + a- 1 ^. (24) 
By (|2"3"|) we have that 

l/A -li + a - 1/2 | 2 )l < C(l + l -^y N (25) 
a a a 

for s in a neighbourhood V(sq) of So, — > a~ a and |^| < e'a 1 / 2- " for some 
e' < e. Now we first split the integral B according to 

B = a~ 3/4 / iMi/^-fi + a- 1 / 2 ^)]^^,^)!^^ 

■/|li|/o>a-° a 

= a- 3 / 4 / + a ~ 3 / 4 / (26) 

J\il\/a>a-", || 2 |< e / a i/3-a ^|fl|/o>o-° £2 1 >e / a 1 / 2_a 

v v ' v v ' 

Bi B-2 

By (f2"5|) we can estimate -Bi according to 

Si <Ca QA, - 3/4 ||^||i (27) 
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It only remains to estimate B2. For this we will use the fact that -^-ni> <G L 2 (M 2 ). 

OX 2 

This implies that 



Putting together the estimates (fT8|) . (|21|) . I|27p and (|28|) we finally arrive at the 



4 Frames 

The goal of this section is to extend the range of validity for the representation 
formula © allowing general functions with vanishing moments. This is impor- 
tant in order to enlarge the scope of potentially useful shearlets. These results 
will also serve as a tool to prove the main theorem in Section [5J Note that the 
formula ||2J) is valid for any shearlet but this representation comes with some 
disadvantages such as dependence on the choice of coordinate axes. In what 
follows we present three different generalizations of © , each one with different 
advantages and disadvantages. In Theorem l4.4l we find a representation formula 
which is valid for any choice of a window function W but this representation 
is not tight (to be defined later) and it requires to project the data / onto a 
frequency cone prior to the analysis. Next, in Theorem 14.51 we eliminate the 
need to perform this projection at the cost of possibly worse frame constants. 
Finally, by carefully choosing the window function W we show in Theorem 14.91 
that we can actually wind up with tight frames if we perform a frequency pro- 
jection onto a cone prior to the analysis. First some definitions: We utilize the 
concept of continuous frames which has been introduced in [TJ . 

Definition 4.1. Let (X,fi) be a measure space with Radon measure \x and 
= { l fx)x£X a family of elements in some Hilbert space TL indexed by X . $ is 
called a frame in Ti if there exist constants A, B > such that 



A frame is called tight if A = B. 

If $ is a frame for TL, we call the operator T : TL — > L 2 (X,n),f 1— > 
((fTfx))xex the analysis operator and its adjoint T* the synthesis operator. 
The frame condition ((29)) assures the boundedness and lower-boundedness of 
the frame operator S : TL — > TL, f 1— ► T*Tf := J x (f,(p x )(p x dfi(x). Call the 
family $ = (<p x )xex with (p x := S~ 1 (p x the canonical dual frame of $. Then it 
is not hard to see that 




desired conclusion. 



□ 




for all f€H. 



(29) 
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(i) $ is a frame with frame constants -g , , and 

(ii) We have the reproducing formula 

/=/ {f,<t>x)<Pxdii{x) = {f,<Px)<Pxdfi(x), (30) 
Jxex Jxex 

where the equality holds at least in a weak sense. 

(iii) For tight frames with frame constant A the dual frame elements are given 

by fix = \<Px- 

The 'quality criterion' for frames in terms of efficiency and accuracy of the 
inversion of the frame operator is the ratio B/A, so in general it is desirable for 
a frame to be as close to tight as possible (such frames are also sometimes called 
'snug frames'). An example of a frame for the Hilbert space L 2 (R 2 ) would be 
the family 

* = (Vw)(a,s,t)eR + xRxR 2 

where the measure fi has Lebesque density a~ 3 and tp is & shearlct. In this case 
the frame constants are equal: A = B = and the canonical dual frame ^> 
equals The drawback of the representation of a function / e L 2 (R 2 ) arising 
from this frame is that the directions indexed by s are not equally distributed 
as s grows large. The goal of this section is therefore to restrict the parameters 
a, s to a compact interval. If the parameter a is restricted to a compact interval 
this means that low frequency phenomena will not be representable. Similarly, 
if s is restricted to a compact interval, this means that not all directions of 
singularities will be representable. We therefore shrink the Hilbert space L 2 (R 2 ) 
to the smaller space 

L 2 (C U , V ) V := {/ G L 2 (R 2 ) : supp / C C u , v }, 

where 

C u , v :={?GR 2 : |6I >«, 161 <v|£i|}- 

We shall always require u, v > 0. Once we succeed in constructing shearlet 
frames for the Hilbert space L 2 (Ci ; i) v , we are able to represent any signal 
/ G L 2 (R 2 ) as follows: Let 

C v u , v :={?eR 2 : 161 >«, I6I<«I6|}- 

Denote by Pc u ^ the orthogonal projection onto the space L 2 (C Ujt ,) v and by 
Pc» the orthogonal projection onto the space L 2 (C^ V ) V . We write Pc := Pc 1A , 
Pc^ '■— Pc\ 1 an d Pv for the orthogonal projections onto the spaces L 2 (C\^) V , 
i 2 (C^) v and £ 2 ([-l,l] 2 ) v , respectively. Then 

• Pef is analyzed using the shearlet ip, 

• Pc f is analyzed using the shearlet ip v (xi,X2) ■= il>(x2,xi), and 

• Pvf is analyzed with some low pass window. 
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In view of detecting singularities, the function Ppf is not interesting since 
it is analytic. We therefore restrict our attention to the detection of directional 
features of the function PqJ which we will henceforth simply denote by /. For 
the case of Pc f the analysis works analogous by reversing the variables. 

We will now analyze the structure of the frame operator on L 2 {C UtV ) v related 
to a system ^ = (Pc u , v ipast) a e[o,r], s€[-h,h], *6R 2 an( ^ the usual measure [i. For 
notational simplicity we shall simply write ip as t for the functions Pq u „ (ipast)- In 
fact it makes little difference if we consider tp ast since for any / G L 2 (C U , V ) V we 
have 

(f, Ipast) = {Pc u , J, Ipast) = (f,Pc u ,v1past)- 

Due to the shift-invariance of the system the frame operator possesses a 
particularly simple structure: 

Lemma 4.2. The frame operator S associated with the system ^ is a Fourier 
multiplier with the function 

A„,,(V)(0 := XC U J0 [ - s^))\ 2 a-^ 2 dads, 

Jo<a<T, s|<E 

Xc u , v denoting the characteristic function of C UiV . In particular, "if is a frame 
for L 2 (C UyV ) v if and only if there exist constants A, B > such that 

A < A UtV (iP) (£) < B for all £ G C UyV . 

Proof. Let f,g G L 2 {C). Then we compute 



((Sf) A ,g) = [ [ [ (f,ip ast )(gj ast )dsa- 3 dadt 
Jr 2 Jo J -h 

r 



{f , ipast) {g, ipast) dsa dadt 

r r s 



= [[[[[ /(e)^Ki,a 1/2 (6 - stimvMamiaVtfa- s m ))e 2 ^-^ d^dndsa^dadt 

Jr 2 Jo i-H Jl 2 Jl 2 

= f r I /W«i,a 1/2 (6 - ^i))s(0^(a£i,a 1/2 (6 - s^dHdsa-^da 
Jo J-eJr 2 

= [ hoWj&uAm)dt= [ KoW)^uAm)dc 

Jr 2 Jc 

We have used the fact that / R2 e 2 ^ lt ^~ 7 ^ dt — 5 V= £ in the sense of oscillatory 
integrals, or in the other words the Fourier inversion formula. Now it suffices 
to choose g to be some approximate identity in the convolution algebra L 1 (K 2 ), 
e.g. Gaussian kernels to conclude that 

(£/)» = A u ,„(V0M/M. 

The rest is trivial. □ 

We can now show the important result that we can build frames from al- 
most arbitrary functions with vanishing moments in iri-direction by letting the 
parameters a and t vary in a sufficiently large but finite interval [0, T], resp. 
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[—3,3]. First some notational conventions: Note that the Definition 11.31 makes 
also sense for general n 6 R. We say that a function tp has Fourier decay of 
order e in the second variable if V>(£i, £2) = 0(|^2| _e ) for large £2- Fourier decay 
in the first variable is defined in an analogous fashion. 

Theorem 4.3. let ip be a continuous shearlet with at least 1 + e > 1 directional 
moments, Fourier decay of order r > 1/2 in the second coordinate and Fourier 
decay of order \x > in the first variable. Then there exists T, 3 such that the 
family (Pc„,„ i>ast)ae[o,r), se[-E,E], teR 2 constitutes a frame for L 2 (C UiV ) v . 

Proof. The proof is given in the appendix. □ 

Using the previous results we can now show the following representation 
formula: 

Theorem 4.4 (Representation of L 2 -functions, frequency projection). Let ip 

be a shearlet such that (-? > c' ! /'ast)ae[o,r], se[-H,H], teR 2 constitutes a frame for 
L 2 (Ci ! i) v with frame constants A, B and let W be any function with 

A<\W(0\ 2 <B for all£e [-1,1] 2 . (31) 
Then we have the representation formula 

41/11! < / \(Pvf,T t W)\ 2 dt+ f f f \Sn^P c f(a,s,t)\ 2 a- 3 dadsdt 

JK 2 ./teR 2 Jse[-3,3] JaS[0,r] 

+ [ f f \SH^Pc»f(a,s,t)\ 2 a- 3 dadsdt<B\\f\\l (32) 

./teR 2 JsG[-H,H] Ja£[0T] 

for all f G L 2 (R 2 ). In every point of continuity x of f we have 

f(x) = [ (Pvf,T t W)T t P v Wdt+ [ [ [ SH^Pcf(a,s,t)P c ^ ast (x)a- 3 dadsdt 

jr 2 ./teR 2 Jse[-s,s\ Joe[o,r] 

+ [ [ I SH^Pc* f(a, s,t)P c ^ ast (x)a- 3 dadsdt (33) 

./teR 2 Jse[s,s] Jae[o,r] 

where W is any function with (W^(£)) A = W^(C) 1 f or a ^ £ £ [ — 1) I] 2 - 

Proof. First we note that the frame operator for the system (TtW)teR 2 ) for the 
Hilbert space L 2 {\— 1, l] 2 ) v is given by the Fourier multiplier with the func- 
tion x(£)l^(£)| 2 > where \ is the characteristic function of [— 1,1] 2 . The proof 
is standard. It follows that the dual frame is given by (T t W) te ^2 and W de- 
fined as above. It also follows from our assumptions that frame constants of 
the system (T t W) te R 2 ) are given by A, B. We already know that the sys- 
tems (Pc^ast) a e[o,r), «e[-H,s], teR 2 and (-PcV£ s t)ae[o,r], «g[-h,h], teR 2 constitute 
a frame for L 2 (Ci.i) v , i 2 (CJ' 1 ) v , respectively with frame constants A, B. Since 
the three space L 2 (Ci,i) v , L 2 (Cj / 1 ) v and L 2 ([— 1, l] 2 ) v are mutually orthogonal, 
(1321) follows. Equation (|33[) can be shown by polarization and convolution with 
an approximate identity (i.e. convolution with Gaussian kernels). □ 

Actually, there is no need to perform a frequency projection prior to the 
analysis and we can wind up with a truly local procedure: 
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Theorem 4.5 (Representation of L 2 -functions, no frequency projection). With 
the assumptions from Theorem \4-4\ the system 

(Vw) o e[0,r], »e[-B,H], tm 2 u C^ast)ae[o,r], »e[-B,B], tm 2 u ( T tW) teR 2 

constitutes a frame for L 2 (R 2 ) with frame constants A,3B. The associated 
frame operator is given by the Fourier multiplier with the function 

n(0 := A ,oo W(0 + A ,oo(r)(0 + l^(OI 2 - 
FKe /iaue i/ie representation (valid in every point of continuity of f) 

f{x) = ( {f,T t W)T t Wdt+ [ [ [ Sn^f(a,s,t)^ ast (x)a~ 3 dadsdt 

Jr 2 Jtsm? Jse[-E,s] iae[o,r] 

+ [[ I SH^f(a,s,t)i> ast (x)a- 3 dadsdt 7 (34) 

J t gR2 Jsgf-H^] «/ae[ci,r] 

Proof. The fact that the frame operator is given by multiplication with Q follows 
from Lemma 14.21 and Theorem 14.41 The estimate on the frame constants is 
immediate. We remark that the estimate ZB for the upper frame constant is 
rather crude. □ 

Remark 4.6. It seems to be a difficult (yet very important in our opinion) 
question if there exists a dual frame (t/w) to the system (ipast) which carries 
the same structure, meaning that 

ipast(xi,x 2 ) = a J/ ip{ , -pr-) 35) 

for some function if>. Note however that at least by choosing T and S large 
enough we can make the frame arbitarily close to tight. One possible approach 
in order to get tight frames is to enforce the function Q to be constant by choosing 
a suitable window function W . This can always be done but the crux is to show 
that this W is actually a useful window function in the sense that it has e.g. 
fast Fourier decay. 

The next lemma shows that if we restrict ourselves to data with frequency 
support in a cone, then the window functions that we get by enforcing a tight 
frame property are actually useful. We plan to study this approach further in 
future work. 

Lemma 4.7. Define W by 

Att,„(VO(0 + \W(0\ 2 = C*xc,.(0- (36) 

Assume that S > v, u > and that ip = -j^wQ has M anisotropic moments, 
Fourier decay of order L\ in the first variable and that 9 has Fourier decay of 
order L 2 in the second variable such that 

2M - 1/2 > L 2 > M > 1/2. (37) 
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Then 

\W{Z)\ 2 = o(\£\- 2min{Ll ' L2 - M) ). 

In particular if if) is sufficiently smooth and has sufficiently many vanishing 
moments then W is a smooth function (i.e. a useful window function). 

Proof. By definition we have 

\W(0\ 2 = Xc u JO( [ \^(a^,^2-s^))\ 2 a- 3 / 2 dads 

JaeM, |«|>5 

\4>(a^V^(^ - s^))\ 2 a- 3 / 2 dads). 

a>T, \s\<E 

We start by estimating the second integral using the Fourier decay in the first 
variable: 



3 ' 2 da 



f \^(a^,^2-^i))\ 2 a- 3/2 dads < 2Ec[ {a\^\)- 2Ll a 

Ja>V, |a|<S Ja>r 

= o(iar 2Li ) = o(i^r 2Li ) 

for all £ S C UiV . To estimate the other term we need the moment condi- 
tion and the decay in the second variable. We write -0(C) — an d 
£ = (£i,r£i), \r\ < v for £ £ C u>v . We start with the high frequency part: 

/ \^(a^,^2~s^))\ 2 a-^ 2 dads = f |<i| 2M |^«i, y/Zfa - \ 2 a~ 3 / 2 dads 

J a<l, |a|>E Ja<l, \s\>3 



< cf \at 1 \ 2M \Va~{&-sb)\- 2L2 a- 3 / 2 dads 

Ja<l, \s\>3 

= Cf a 2M - L ^ 2 \^\ 2M - 2L2 \r- S \- 2L2 dads 

J a<l, \s\>5 



>a<l, |s|>E 

we have used that |r — s\ is always strictly away from zero because v < 5. By 
assumption L2 = 2M — 1/2 — e for some e > 0. Hence we can estimate further 

■•■ = C|ar 2(i2 ~ M) / a- 1+ "\r-s\- 2L2 dads = 0(\^\- 2 ^- M '>). 

Ja<l, \s\>3 

The low- frequency part can simply be estimated as follows: 

$(afi,Vo(6- sa))| 2 a" 3/2 dads < CICil" 21,2 / fl- 3 / 2 ^ 2 |r - s|- 2L2 dads 

a>l, |s|>H J a>\, |s|>H 

= oar 2L2 ). 

Putting these estimates together proves the statement. □ 

Remark 4.8. We find it quite interesting how the smoothness and the moment 
conditions have to interact in the shearlet transform. This stands in contrast to 
the wavelet transform where only smoothness is required to arrive at a statement 
similar to Lemma \4.7\ Similarily, for wavelets to satisfy a direct theorem they 
are only required to satisfy moment conditions and essentially no smoothness. 
Shearlets, on the other hand, need to satisfy moment- and smoothness condi- 
tions. Of course, when we speak of smoothness related to wavelets, we mean 
conventional smoothness and not directional smoothness as measured via the 
Wavefront Set. 
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Using the previous lemma we are finally able to obtain tight frames for 

Theorem 4.9 (Representation of L 2 (C„ jt ,) v -functions, tight, frequency projec- 
tion). With the assumptions of Lemma \4-7\ and W defined as in h3di) . the system 

{Pc u , v ipast)ae[o,r), »e[-H,B], *gk 2 U (T t Pc UjV W) teR 2 

constitutes a tight frame for L 2 (C UiV ) with frame constant dp. We have the 
representation 

fix) = ±- [ (f,T t W)T t P c _Wdt 

+ -L f f [ SH i ,f{a,s,t)P Cu ^ast(x)a~ 3 dads4B&) 

W> Jtm 2 Jse[-s,s] Jae[o,r) 

The window function W satisfies the Fourier decay estimate from Lemma \4.7\ 

Proof. The frame operator is given as the Fourier multiplier with the function 
A m ,„(t/>)(£) + Xc n , v (€)\W(0\ 2 = Xc u ,A0 c 4>- lt follows that the frame operator 
is given by C^,Pc u v = C^I on L 2 (C Uj „) (I being the identity). □ 

Remark 4.10. For applications it is important to restrict the parameters a, s, t 
to a discrete set. This problem has already been studied in the general framework 
of continuous frames in 1201. Let us remark however that we see little hope that 
discretizing continuous frames will lead to discrete and non-bandlimited tight 
frames. The reason for this pessimism is that we are not aware of any useful 
tight frame construction for non-bandlimited wavelets which comes from dis- 
cretizing the continuous wavelet transform. To our knowledge the only useful 
and general method to construct wavelet tight frames ( or wavelet frames with 
wavelet duals) comes from Multiresolution Analysis constructions in combina- 
tion with the 'unitary extension principle ' We are currently pursuing the 
goal to generalize this construction to the shearlet setup t23jj . 

5 An inverse theorem 

In this section we prove a partial converse to Theorem 13.11 We show that if 
the shearlet coefficients of a function around (to,So) decay sufficiently fast in 
a, then (to,So) is a regular directed point. Before we can get to the proof we 
need some localization results. First we show that a frequency projection on a 
conical set retains the wavefront set. 

Lemma 5.1. The point (icb s o) is an N - regular directed point of g £ L 2 (R 2 ) 
if and only if (to, Sq) is an N - regular directed point of Pc u v g provided that 
s < v. 

Proof. This statement is not trivial as it might seem at first glance. We first 
show the 'only if'-part. Write Peg = g — Peg <?, where Pc^ u is the orthogonal 
projection onto the (closure of the) complement of C UjV . By assumption there 
exists a cutoff function $ supported around to such that 

($ 3 ) A (£) = 0(\£\- N ) for all £ 2 /£ x S (s -S,s + S) 
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for some 6 > 0. Clearly, since so G (— v,v) the point (£o>So) is an iV - regular 
point of Peg g. Therefore the same estimate as above also holds for (Peg „ff) A - 
The point is now that by Lemma l2~2"l the same estimate holds also for ($Pcg „<7) A - 
It follows that an analogous estimate holds for (<5>Pe u ,„g) A - This proves the 'only 
if'-part. For the proof of the 'if'-part we estimate (<&Pc£„<?) A with the same 
method as in the proof of Lemma l^^l and see that it is negligible for the decay 
properties of ($g) A restricted to a small cone around the line with slope So- d 

Remark 5.2. We do not know if the above lemma still holds true for sq = v. 

We recall the definition of the iV-th fractional derivative of a function / 
defined by 

The following lemma states some well-known results for fractional derivatives 
with N <£ N. 

Lemma 5.3. 

(f(-/a)) {N) (x)=a- N fW(x/a). (39) 
||(/<?) (Ar) ||2<C(||/W||4|| 5 ||4 + ||/||4||.9 (A,) ||4) N<1. (40) 



Proof. The first equation ((39)) is an easy exercise while equation (|40p is a special 
case of the so-called " fractional product rule" , cf . [TT] . □ 

The following lemma shows that in studying the regularity properties of / 
around to only the shearlet coefficients of / around to are relevant. We only 
formulate and prove it for JVsN, the general case can be shown using Lemma 
IP 



Lemma 5.4. Let f G L 2 (R 2 ), $ be a smooth bump function supported in a small 
neighbourhood V(to) of some to € K 2 and let U(to) be another neighbourhood of 
to with V(to) + SB C U(to) for some 5 > 0. Here, B denotes the unit disc in 
R 2 and + denotes the Minkowski sum of two sets. Consider the function 

g{x) = / (f,$ as t}<f>(x)ip ast (x)a- 3 dadsdt. (41) 

Jte(7(t ) c ,se[-H,H],ae[o,r] 

Then for all u, v 

9(0 = O(\£\- N ) |C| — oo, fori&C u , v (42) 

provided that 

9 j (x) := = 0(\x\- p ') with Pj/2 - 3/4 > j + 2, j = 0, ... ,N. 

(43) 

Proof. Consider the Radon transform 

I(u) :=1Zg(u,s) = / g(u - sx 2 ,x 2 )dx 2 , \s\ < v. 



We will show that G L X (R) which implies that 

(I W ) A H=/lH G L°°(R) 
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which by the projection slice theorem implies that with £ = w(l, s) 

13(01 = I'MI < l|J w lk w M- w < n^iu.wv/r+^ier^ 

which proves the statement. We now show that iW £ i^K). In fact, since 1 
is of compact support we only need to show that is bounded. 

J (JV) (u) = /" (/, i>ast)(-^) N I ${u- sx,x)ip as t{u- sx,x)dxdfj.(ast) 



a s I, 
N 



^ ( iO /" ^'^ ast ^ J (]fu) N 3 ®^ U ~ sa; ' a; )(^) : '^ s t(w - sx,x)dxdfi(> 
? ( j/ / ^ , ^ ast ^ a " : ' J (~I^ N J ®( u ~ sx > x ) e ist( u - sx,x)dxd^(as 



3=0 

where J ' = (afr)>- Ar gueing as in the localization part at the beginning of 
the proof of Theorem O we can show that |^ st (a;)| = 0(a- 3 / 4+p ^ 2 |a;-^^). 
Since (gj - ) has small support around to and the parameter t varies in a 

set which stays away from the support V(to) of (g§~) ,3 3>#-' we can estimate 

l(^) W " i *(»)^(»)l= (l(^) W " i *(*)l ^ P ^l*-^l" P, ) 

for t € U(to) c . Putting this estimate into (|4"4")l and using (j4"3"]) we arrive at the 
desired statement. □ 

We are finally in a position to prove the main result of this section, namely 
an inverse theorem. Again, we only formulate and prove it for N £ N, the 
extension to arbitrary N can be achieved via Lemma 15.31 

Theorem 5.5 (Inverse Theorem). Let f £ L 2 (C U , V ) V , oo > u, v > 0. Assume 
that there exist neighborhoods U(to) C R 2 of to and (sq — e, sq + e) C [— so,so] 
of So such that 

SHj,f(a,s,t) = 0{a K ) for all (s,t) £ (s - e,s + e) x U(t Q ) (45) 

wf/j the implied constant uniform over s and t. Then (to,Sa) is an N- regular 
directed point of f for all N with such that ip £ iJ(jv,L)(R 2 )> & > r ?/>(w) , 
Uh% eS T e l1 ( r2 ), j = 0,...,N and for some 1/2 < a < 1 

7V+2 < min (#-3/4, (l-a)(M+JV)-3/4, (a-l/2)L-3/4, 2(£ 2 -M+l), 2(L X +1)), 

(46) 

where M > 1 is the number of anisotropic moments ojip, L is the Fourier decay 
of ip in the second coordinate and £1,^2 are defined as in Lemma \4-. 7| such that 
holds. 

Proof. Choose T, S such that the system 
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constitutes a tight frame for L 2 (C U)V+K ) and v+n > s , with W chosen according 
to Lemma 14.71 The goal is to prove that for a localized version / of 

9=1 {I, i>ast)i>a,sta~^dadsdt (47) 

J t^M. 2 , s€(-a,B), oe(o,r) 

around to the Fourier transform of the function I(u) := lZf(u,So) decays of 
order |w| _Ar for \ui\ — > oo. This would prove (by the projection slice theorem) 
that (to, so) is a regular directed point of g. To show that this already implies 
that (to, so) is a regular directed point of /, we argue as follows: By Theorem 
14.91 we have the representation 

/ = ^rPc u V+K (.9 + / (/, T t W)T t Wdt) 
W Jtem 2 

It follows from Lemma [5.11 that (to,So) is an N- regular directed point of / if 
it is an TV-regular directed point of g + f t R2 (f,TtW)TtWdt. By Lemma [4.71 
and PS]) , {to, so) is an iV-regular point of J M2 (f,TtW)T t Wdt, and therefore 
we only need to verify regularity for g, which we will now do. 

First note that by Lemma 15.41 we can without loss of generality restrict the 
parameter t in the integral (|47|) to U(to) if we multiply by a suitable cutoff 
function Therefore we need to study the regularity properties of 

/=/ (f,^ as t)^ipasta~ 3 dadsdt, 
Jteu(ta), »e(-s,s), ae(o,r) 

where $ is supported in a small neighbourhood Vo(to) CC U(to) around to- Let 
us denote by I(u) the function 

I(u) := Tlf(u, s ) 

with 

Uf(u,s )= / (f,ipast)Tl®ipast{u,so)a~ 3 dadsdt, 
Jt£U(t ), se(-E,E), oe(o,r) 

and 

t>k i i \ -3/4 [ K ( \ ,u- s Q X2-t 1 +s(x 2 -h) x 2 s 
TZ$ip as t{u, s ) = a J/ J •&(u-soX2,x 2 )ip( ,— r j^)dx 2 - 

To prove our goal that I(lu) = 0(\u\- N ) we need to show that lu n I(lu) £ L°°(R) 
or the stronger statement that the fractional derivative I^ N ' of I defined by 

I (N) (u) ^(^fnu) := K/(«)) V (u) 

is in L\R). 

Unless stated otherwise in what follows the variables a,s,t are allowed to 
vary over the sets [0, T], [— S, S], U(t ), respectively. 

By the usual product rule and the definition of 1Z the quantity H^^Hi can 
be estimated by 
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C max J / \(f,ipast)\a J \Tl( n N _ } ipast) (u, s )\dfi(a, s,t)du 



gN-j qj 
-'I' — • 

j=o,...,N J K J a s t " 1 y dx 1 ~ 3 dx\ 

We only treat the case j = N, the other cases can be dealt with analogously. 

Assume that the function $ is zero outside a cube of sidelength 77 around 
to- Then it is easy to see that the support of I N is contained in the interval 
Iu = [(to)i — so(£o)2 — 2r/, (to)i — so(to)2 — 2r/\ and the integration variable X2 from 
the definition of 1Z can be restricted to the interval Ix = [(^0)2 — 7 h (^0)2 + v]> 
where to = ((^0)1,(^0)2)- We now separate the quantity 

QN 

I Iu Ja,s,t ux l 

with 



I (/, ^ast) la^l^C^^nv^-*) s o)\Ma, s, t)du =:A + B 



, ip ast ) \a N \TZ(^-^-^ip a st) (u, s )\d(j,(a, s, t)d% 



B = j j \(f,ip ast )\a N \n($j^ r ip ast ){u,s Q )\d[i(a,s,t)du. 



J Iu J a.,s£(s —c,s +E),t 

and 

IU - ''«.-/;•!" -'|/v(<i> ! 

'/[/ Jo,se[— H,H]\(so— E,s +e),* 

In order to estimate B we note that 

d N f d 

7£($— -^Vw)(w,so) = / $Trjj-4>ast{u-s Q X2,x 2 )dx2 = SHg$6 Xl+SoX2 - u (a, s,t), 
ox 1 J] x ux 1 

which means the shearlet transform of the delta distribution concentrated on 
the line x\ + sqx 2 — u — and localized by $ w.r.p. to the shearlet 9 = -§^wip 
with M + N moments. It is well known and easy to see that for s G (—3, £) \ 
[sq — e, so + e], the point (t, s) is an i?-regular directed point of 5 Xl +s x 2 -u for 
all R G N (in other words (t, s) is in the analytic Wavefront Set of 5 xl + SoX2 - u ), 
hence of $S Xl + S()X2 - u by Remark 12.31 By using the same arguments as in the 
proof of Theorem 13. II we see that for any 1/2 < a < 1 the estimate 

SH e $5 Xl+S0X2 - u (a, s, t) = ( a (i-)(M+iV)-3/4 + a -3/4 +(a -i/ 2)i) (48) 

holds with the implied constant uniform over t G U(to), s G (— S, H) \ [sq — 
s, sq + e]. The details are given in the appendix. Since by assumption there 
exists 1/2 < a < 1 such that 

N + 2 < min((l - a){M + N) — 3/4, (a - 1/2)L - 3/4), 

the expression B is bounded. 

In order to estimate A we use the fast decay of the shearlet coefficients of / 
around (to,so). By our assumptions on N, K, the coefficients (f, ipast) decay 
of order greater than a N+2+3 ^ 4 , and therefore A is bounded. □ 

Following this rather technical theorem we state an informal version of the 
inverse theorem: 

Theorem 5.6 (inverse theorem, informal version). Assume that ip has suffi- 
ciently many vanishing moments in the Xi direction, is sufficiently smooth and 
sufficiently well-localized in space. Assume that J^5[ ) holds for some f. Then 
(to, so) is an N -regular directed point of f for all N < K — 11/4. 
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6 Resolution of the Wavefront Set 



We now draw some conclusions to the previous results. They all follow immedi- 
ately from Theorems 13.11 and 15.51 In terms of resolving the TV- Wavefront Set we 
have the following result which we formulate only for / S L 2 (M. 2 ) for simplicity. 

Theorem 6.1 (Resolution of the Wavefront Set I). Let f € L 2 (R 2 ), N e K 
and e > 0. Then there exist P, M, L, L\, L<x such that for all functions ip € 
H(n.q) (K 2 ) with M vanishing moments in x\- direction, decay of order P towards 
infinity, C L in the second coordinate and L\,L^ as in Lemma \4-7\ we have the 
following result: write T> = T>\ U T>2, where T)\ = {(to, So) € R 2 x [— 1, 1] : for 
(s,t) in a neighbourhood U of(so,to), \SH.^f(a,s,t)\ = 0(a N ), with the implied 
constant uniform over U} and T>2 = {(^Oi s o) €E M 2 x (l,oo] : for (l/s,t) in a 
neighbourhood U of(so,to), \SH^vf(a,s,t)\ —0(a N ), with the implied constant 
uniform over U } . Then 

WF N+Z/4 +s ^y g v g WF N-U/4-e^y_ ^ 

The precise values of P,M,L,L\,Li can be read off Theorems \3.1\ and \5.5\ 

Proof. First we need to show that if \S7i^f(a,s,t)\ — 0(a N ) for \s\ < 1, then 
\SH.^Pc u „/(a, s,t)\ = 0(a N ) with some suitable cone with v > 1 and P, L, M 

large enough. To this end we can estimate the integral f Cc /(OV'o«*(0 < ^ = 
0(a N ) using precisely the same estimates as in the proof of Theorem 13 . 1 1 except 
for the estimate for B\ in that proof. This shows that indeed \SH^Pc u v f{a, s,t)\ — 
0(a N ). The reverse implication can also be shown using the same argument. 
We arrive at the following statement: 

\Sn^f(a,s,t)\ = 0(a N ) for |s| < 1 & \SH^P Cu ,J(a,s,t)\ = 0(a N ) 

for some cone with v > 1 and P,L,M large enough. The case s > 1 is similar. 
We also need the fact that for s < 1 the point (t, s) is an TV-regular point of 
Pc u „ / if and only if (t, s) is an iV-regular point of /which is Lemma 15.11 Now 
the statement follows directly from Theorems 13.11 and 15.61 □ 

Remark 6.2. Certainly it would be desirable to have an equality in \^9^ instead 
of the inclusions that we obtained. Despite considerable effort we were not able 
to obtain such a result and we are not sure if such a results holds at all. We 
believe that the reason for this is that our notion of Wavefront Set does not 
correspond to any useful microlocal function space. Usually Fourier decay of a 
function is not measured as in our definition of the N- Wavefront Set but rather 
in terms of a Sobolev (or more generally Besov) norm restricted to a cone like 
for instance 

I (i + ieiW(0i 2 <oo 

for all £ in a cone C\ around the direction A. Such measurements of the direc- 
tional Fourier decay of a (localized version of a) tempered distrubution lead to 
the concept of microlocal Sobolev spaces. In ^ microlocal Sobolev regularity 
has been fully characterized in terms of a curvelet square function for tempered 
distributions (assuming compact frequency support for the curvelets). In future 
work we would like to generalize these results to our setting. However, for our 
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present purpose, which is essentially to generalize the results in \2T] , the results 
of the previous sections are - as we shall see below in Theorem \6.4\ ~ just what 
we need. 

In [57], the authors considered the full Wavefront Set defined as 
WF(/) = |J WF W (/) 

AfGR 

and showed that for very specific choices of if> the decay rate of the shearlet 
transform coefficients determine WF/ where / is a tempered distribution. We 
show that the result is actually valid for any Schwartz functoin with infinitely 
many vanishing moments in xi-direction. First we show this for / with frequency 
support in a conical wedge. 

Theorem 6.3. Assume that ij) is a Schwartz test function with infinitely many 
vanishing moments in x\- direction. Then 

WF(f) — {(t, s) C M 2 x [—v, v] : STt^,f(a, s, t) does not decay rapidly locally around (t, s)} 
for any tempered distribution f with frequency support in C UfV for u,v > 0. 

Proof. We have already proved this result for / e L 2 (C u ^) y . Since V is a 
test function the generalization to tempered distributions follows easily by just 
repeating the same arguments. □ 

The following theorem has been proven in [27] for very specific choices of ip- 
As already stated, all we require is infinitely many vanishing moments in the 
xi-direction. 

Theorem 6.4 (Resolution of the Wavefront Set II). Let ip be a Schwartz func- 
tion with infinitely many vanishing moments in x\-direction. Let f be a tempered 
distribution and D = T>\ U T>2, where T>\ = {(to, so) € K 2 x [—1, 1] : for (s, t) in 
a neighbourhood U of (so,to), \STC t f,f(a,s,t)\ — 0(a k ) for all k € N, with the 
implied constant uniform over U} andT>2 = {(to, So) £ R 2 x (l,oo] : for (l/s,t) 
in a neighbourhood U of (so,to), \SH^ f(a, s,t)\ = 0(a k ) for all k £ N, with 
the implied constant uniform over U}. Then 

WF(f) c = V. 

Proof. This is an immediate consequence of Theorems 13.11 and 15.51 making the 
usual adaptions to handle general tempered distributions. □ 

7 Concluding remarks 

In the present paper we have shown that there is a lot of freedom in choosing 
a shearlet. In particular this allows us to define a shearlet decomposition with 
respect to a compactly supported shearlet. Be it a tensor product wavelet or 
any directional derivative of a smooth function, our results show that any such 
function possesses the ability to resolve directional features of a given function. 
For future work in this direction we would like to study the following: 
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• Tight Frames. As already suggested in Remark 14.61 it is possible to 
enformce a tight frame property by choosing an appropriate window func- 
tion. We showed that for functions with frequency support in a cone 
this leads to useful window functions. We want to pursue this approach 
in more detail with the goal of constructing compactly supported tight 
frames for L 2 (1R 2 ). 

• Discretization. We would like to discretize the frame construction using 
results in |20j and study computational issues like inversion of the frame 
operator and so on. 

• Sparsity. For curvelet and shearlet transforms there exist several results 
confirming sparsity of certain objects (images, FIOs) in the respective 
representation [231 HS1 El EH Hj ■ We would like to extend these results to 
our general setting. 

• Microlocal spaces. In [9] the authors proved a characterization of mi- 
crolocal Sobolev spaces via a curvelet square function. We would like to 
extend these results to our setting and also more general microlocal Besov 
spaces. 

• Compactly supported curvelets? Do functions with directional mo- 
ments also serve as curvelets, in other words, is it possible to construct 
frames from functions with anisotropic moments by replacing the shear 
transform by rotations? This has actually been done in [33], see also the 
section on 'Hart Smith's transformation' in [5]. 
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A Appendix 

Proof of Theorem \4-3\ To prove the theorem we need to estimate 

I $(o£i, - \ 2 a- z ' 2 dads (50) 

Jo<a<r, \s\<S 
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for (£1,^2) G C u<v . Notice that by assumption 

I \^ 1 ,^ 2 -s^))\ 2 a-^ 2 dads = C^ V(£i,6)e 

Ja£R + , seR 



and 



/ \e(a^,^2~^i))\ 2 a- 3/2 dad S = C e V(6,6) G 

JaGR+, sGR 



2GR+ 

where ^(£1,62) = £!#(<!;i, £2)- Since for all £ g C UiV we have |£ 2 | < an d 
therefore the estimate 

|6-^i|>(|s|-«)|6l (51) 

holds. Furthermore, since ^£> has Fourier decay of order r in the second coordi- 
nate, we have the decay estimate 

|fe&)|<C(l + |6| T ) _1 - (52) 
Now we estimate for any (5 > 

f |^(a£i,^fo-s£i))| 2 a- 3/2 dad S = v^T / |^(a, v^Ife-^i)) | 2 a' 3/2 ^ 

JaSR, s>H J 



I a<8 J a>6 
A B 

The first term can be estimated by 



a < s 2 ^^\ \e{d,^d/\^\(^- s ^))\ 2 d-^ 2 ddd s 



< ^ / VWwH", v^IiTfe - 8^)) \ 2 d^' 2 ddd S < c 9 5 2s . 

JaeR+.seR 

To estimate B we use (1521) and (E 



b = vUTT / |^(5,v^7^(6-^i))l 2 ^ 3/2 ^d S 

./a><S,|s|>E 

< cvllil/ (Va7jii||6-*ei|)~ aT a- 3 / 2 d5d fl 

./3><S,|s|>E 

< cy/\&\ [ (^d/\^\(\ s \-v)\^\y 2T d- 3 / 2 ddd S 

J a>S,\s\>S 

< q£il 1/2 - T / d-^ddf (\s\-vy 

Ja>S J\s\>E 



- 2T ds 



< Cu 1 ' 2 -- d- 3 / 2 ~ T dd (\ S \-v)- 2T ds^Q 

Ja>8 J\s\>S 

for 5 — > 00. We have used that |£i | > u. It is now clear that by choosing 6 small 
and 5 large enough, the integral J a£R , , >H |$(a£i, y/afa — s^x)) \ 2 a~ 3 / 2 dads can 
be made arbitrarily small uniformly in £ G C Ul „. Next we use the Fourier decay 
in the first variable to estimate 
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/ hKofl.VoCfr- s^))\ 2 a- 3/2 dads < [ (a^y^a^^dads 

Ja>F, \s\<S Ja>T, |s|<H 

< u-"2E [ a- 3/2 -^da -► 
in>r 

forT^oo. It follows that the integral J a>r | s i <n |V>(a£i, v / a(?2 — s?i)) | 2 a~ 3 / 2 (iads 
can also be made arbitrarily small uniformly in £ G C UlV by choosing T large 
enough. 

Now we are ready to finish our proof: 

(HDD = / ^(0^,^(6 -s£i))| 2 a~ 3/2 dacis 

JaGR, |s|>H 

- / h?(a£i, " | 2 ^ 3/2 ^. 

Ja>r, |s|<H 

By the previous discussion we can thus bound 

cv - v < O < 

uniformly for £ 6 C Uj „, any z/ > and T, E large enough. This concludes the 
statement. 

□ 

Lemma A.l (proof of (|48p ). Assume that ip has M vanishing moments in x\- 
direction and that uj^ ai iJ}(u;), (-^rtp) A G L 1 (R 2 ). With $ a smooth bump 
function we have for any 1/2 < a < 1 and s ^ sq that 

SH^S Xl + S0X2 - u (a, 8, t) = 0(o (l-a)W-3/4 + -3/4+(a-l/a)i;) . 

Proof. The proof is very similar to the proof of Theorem 13. 1[ the only difference 
is that we use the Holder inequality instead of Cauchy-Schwarz. We assume 
without loss of generality that u = 0. It is well-known that the Fourier transform 
of 5 Xl +s x2 is given by o~ Xl +i/ SoX2 - This implies that the tempered distrubution 
given by / := (§5 Xl+SoX2 - u ) A = <l * 5 Xl+1/soX2 is actually a bounded function. 
Now we seperate the integral 

SH^5 Xl+SaX2 - u {a, s,t) = $ast,f) 

into A, B\ and Bi in the same way as in the proof of Theorem l3.ll The estimate 
for B\ given there is also valid for the assumptions of the present lemma and we 
get B\ < a Q7V ~ 3 / 4 ||?/>| i for any N £N. Now we turn to an estimate for A. We 
write V>(£i,6) = £i M #(£i,6)- We use 6 G L^R 2 ), and / € L°°(R 2 ) to estimate 
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A as 

A = a 3 / 4 / |/(a,6)II^Ki,a 1/2 (6-^i))l^ 

•>|£i|<a- Q 

= « 3/4 / a M iai M i/(a,6)ii^Ki,« 1/2 (6-^i))Me 

"'l?l|<a- a 

< a^ 1 "^ 3 / 4 / |M,6)II^Ki,a 1/2 (6-^i))l^ 

< a M ( 4 -«) a - 3 / 4 / 1/(6,6)11^1,6)1^1^2 

< a (1 - Q)M - 3/4 ||e||i||/||oo. 

The estimate for B2 is similar and we omit it. 

□ 
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